Protein secondary structure prediction using distance based classifiers
نویسندگان
چکیده
De novo structure determination of proteins is a significant research issue of bioinformatics. Biochemical procedures for protein structure determination are costly. Use of different pattern classification techniques are proved to ease this task. In this article, the secondary structure prediction task has been mapped into a three-class problem of pattern classification, where the classes are helix, sheet and coil. Here we have made an attempt to analyze this secondary structure prediction problem using three distance based classifiers (minimum distance, K-nearest neighbor and fuzzy K-nearest neighbor). The only information about the proteins used is the primary structure (sequence of amino acids) itself. A matrix-based new representation of such categorical data is used to convert the sequence into real numbers. A comparative study among these classifiers has been made based on some standard classification performance measures. From this study, it is found that the simple minimum distance classifier performs better compared to others. 2007 Elsevier Inc. All rights reserved.
منابع مشابه
Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملVoting for the Prediction of Protein Secondary Structure and Its Evaluation
Protein secondary structure prediction is one of the central topics in proteome analysis. Computational methods, developed for the prediction (classification) of protein secondary structures, have been improved substantially since 1990s, allowing us to investigate some of the computational classifiers and attempt to integrate them through voting. The study tries to evaluate whether and how much...
متن کاملProtein Secondary Structure Classifiers Fusion Using OWA
The combination of classifiers has been proposed as a method to improve the accuracy achieved by a single classifier. In this study, the performances of optimistic and pessimistic ordered weighted averaging operators for protein secondary structure classifiers fusion have been investigated. Each secondary structure classifier outputs a unique structure for each input residue. We used confusion ...
متن کاملUsing classifier fusion techniques for protein secondary structure prediction
Classifier fusion techniques are gaining more popularity for their capability of improving the accuracy achieved by individual classifiers. A common approach is to combine the classifiers’ outcome using simple methods, such as majority voting. In this paper, we build a meta-classifier by fusing some already well-known classifiers for protein structure prediction. Each individual classifier outp...
متن کاملPSP_MCSVM: brainstorming consensus prediction of protein secondary structures using two-stage multiclass support vector machines
Secondary structure prediction is a crucial task for understanding the variety of protein structures and performed biological functions. Prediction of secondary structures for new proteins using their amino acid sequences is of fundamental importance in bioinformatics. We propose a novel technique to predict protein secondary structures based on position-specific scoring matrices (PSSMs) and ph...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Int. J. Approx. Reasoning
دوره 47 شماره
صفحات -
تاریخ انتشار 2008